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Preface 

The  application  of  differential  game  theory  to  the 
analysis  of  aerial  combat  offers  the  fascinating  prospect 
of  evaluating  weapon  systems  and  combat  tactics  against 
an  intelligent,  evasive  opponent.  Unfortunately,  it  seems 
the  more  complex  the  model,  the  harder  it  is  to  maintain 
validity.  Consequently  differential  gaming  has  not  yet  made 
a  substantial  impact  an  realistic  combat  analysis. 

One  of  the  limiting  factors  has  been  that  each  player 
has  had  fixed  roles  of  either  pursuer  or  evader.  This  work 
is  the  result  of  my  efforts  to  evaluate  a  method  which  allows 
for  a  change  of  roles.  The  study  is  based  on  a  general  pur¬ 
pose  payoff  first  suggested  in  the  Doctoral  Dissertation  of 
Major  Urban  H.  D.  Lynch,  then  at  the  Air  Force  Flight  Dy¬ 
namics  Laboratory.  Although  both  the  game  model  and  the 
constant  speed,  horizontal  dynamic  model  are  really  too 
simple,  a  particularly  fascinating  result  was  trajectories 
that  correspond  to  realistic  scissors  maneuvers. 

I  am  very  grateful  to  my  advisor.  Professor  Gerald 
M.  Anderson  of  the  Air  Force  Institute  of  Technology,  whose 
knowledge  and  advice  were  invaluable.  I  am  also  grateful 
to  many  of  my  fellow  students,  in  particular  Captain  Robert 
D.  Powell,  without  whose  practical  experience  of  the  problem, 
I  would  have  been  fishing  in  murkier  waters. 
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Abstract 

The  problem  investigated  is  a  method  of  role  determination, 
using  a  general  purpose  payoff,  in  a  differential  game  model 
of  aerial  combat.  Role  determination  it  taken  to  rean  a 
combatant's  selection  of  a  combat  objective,  based  on  his 
relative  advantage. 

The  class  of  game  is  two  person,  free  time,  zero  sum 
and  perfect  information.  The  aircraft  dynamics  are  restricted 
to  constant  speed  in  the  horizontal  plane.  The  controls  for 
both  players  are  limited  turn  rates.  A  fixed  weapon  range 
establishes  a  terminal  cfnttraint  around  one  player.  Using 
limits  on  the  payoff,  a  game  of  kind  is  formulated  to  give 
successful  termination  at  weapon  range  for  both  piayers.  The 
ptyoff  is  based  on  angular  terminal  requirements,  and  allows 
for  a  variety  of  weapons.  Closed  form  solutions  are  used  to 
find  the  solution  in  *r-  '.are*,  oy  constructing  trajectories 
and  surfaces  backwards  from  cost  criteria  on  the  terminal 
surface. 

A  complete  solution  is  not  obtained,  but  the  existence 
of  isolated  singular  arcs  for  this  type  of  game  is  confirmed, 
and  progress  is  mada  towards  fully  partitioning  the  gime 
space.  Many  resulting  trajectories  correspond  to  realistic 
scissors  maneuvers,  and  a  method  of  partial  role  determination 
is  proposed,  using  suck  trajectories  to  indicate  regions  of 
advantage. 


« 


x 


GA/MC/74- 1 


I .  Introduction 

Experience  over  the  last  decade  has  re-emphasized 
the  importance  of  aerial  combat  in  air  warfare.  This  has 
resulted  in  a  number  of  different  efforts  to  improve 
the  design  and  operation  of  combat  aircraft  and  their  weapons 
by  realistically  modelling  the  combat  situation. 

The  energy  maneuverability  approach  is  useful  for 
reaching  and  maintaining  a  generally  advantageous  position, 
though  with  no  guidance  as  to  what  to  do  with  that  position 
(Ref  4).  More  extensive  optimisation  techniques  overcome 
this  drawback  to  some  extent,  but  are  limited  *o  a  restricted 
target  model  (Ref  2).  Computer  simulations  have  been  designed 
to  combine  energy  considerations  with  certain  established 
guidance  xaws  and  role  logic  (Ref  12).  Using  this  method 
realistic  encounters  resutc,  and  useful  trade  off  studies 
are  possible,  though  they  are  limited  by  the  set  of  simulation 
rules  used.  Manual  combat  simulators  are  powerful  tools, 
combining  ''seat  of  the  pants"  and  scientific  analysis;  they 
are,  however,  highly  subjective.  At  completely  the  opposite 
end  of  the  spectrum  is  gaming,  and  in  particular,  continuous 
(differential)  gaming  (Ref  9). 

The  potential  advantage  of  gaming  is  the  introduction 
of  active  opponents  into  a  completely  analytical  model.  How¬ 
ever,  in  application  tv  realistic  problems,  success  nas  been 
limited.  One  limitation  has  been  that  many  applications  have 
used  a  pursuit-evasion  game,  where  one  player  is  a  pursuer, 
and  the  other  is  an  evader.  The  pursuer's  aim  is  to  destroy 
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the  evader,  whose  ain  is  to  escape.  The  roles  of  each  player 
are  fixed  throughout  the  game.  Realistically  however,  in 
combat  between  relatively  similar  aircraft  and  weapon  systems, 
each  wishes  to  destroy  his  opponent.  Allowance  must  be  made 
for  this  in  a  game  analysis. 

The  purpose  of  this  investigation  is  to  allow  for 
a  change  of  roles  bv  using  a  general  purpose  payoff.  The 
basic  game  model  is  zero  sum,  two  person,  free  time,  and 
perfect  iefornation.  The  dynamics  are  restricted  to  a  planar, 
constant  velocity  model,  in  which  each  player  has  a  limited 
turn  capability. 

The  problem  is  discussed  it.  Chapter  II.  The  origins 
and  nature  of  the  problem  are  related  to  general  ideas  of 
role  determination,  and  the  thesis  objective  is  stated.  Game 
and  dynamic  models  are  established,  with  accompanying  assumptions 
and  constraints.  Finally  the  payoff  is  discussed,  and  general 
criteria  for  determining  the  outcome  of  the  game  are  set  un. 

Chapter  III  formulates  the  game  fully:  the  payoff  is 
modified  to  the  game,  and  the  necessary  conditions  then  ap¬ 
plied  to  the  problem. 

In  Chapter  IV  closed  fora  solutions  to  the  game  are 
derived  for  arcs  of  constant  control,  and  for  game  surfaces 
located  at  arc  junctions. 

In  Chapter  V  the  closed  form  solutions  are  used  to  find 
a  complete  solution  in  the  large. 

Chapter  VI  discusses  the  role  determination  problem  ' 

2 


GA/MC/74- 1 


general,  and  in  relation  to  the  results.  Some  comparison 
with  other  game  models  is  made. 

It  is  felt  this  study  makes  a  definite  contribution 
to  the  differential  game  modelling  of  ’erial  combat.  The  use 
of  the  Lynch  payoff  in  the  game  results  in  closed  form  solutions, 
which  generate  trajectories  corresponding  to  practical  scissors 
maneuvering.  Although  complete  analysis  awaits  a  full  solution, 
significant  extensions  are  made  to  the  knowledge  about  this 
type  of  game  and  payoff.  Finally,  a  method  is  proposed  for 
determining  regions  of  advantage  and  disadvantage  to  either 


combatant . 
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1 1 .  Discussion  of  the  Problem 

Origins  of  the  Problem 

The  problem  considered  here  is  an  extension  of  work 
carried  out  at  the  USAF  Flight  Dynamics  Laboratory  in  the 
use  of  differential  gaming  to  model  air-to-air  combat. 
Specifically,  it  is  a  continuation  of  work  on  a  general 
purpose  payoff  begun  by  Lynch  in  his  Doctoral  Dissertation 
(Ref  9:174) . 

Nature  of  the  Problem 

A  serious  limitation  in  many  of  the  results  of  Reference 
9  is  that  the  aim  (objective)  of  each  aircraft  (player)  is 
fixed  throughout  che  game;  usually  one  aircraft  is  trying  to 
escape,  and  the  other  to  capture.  Some  useful  indications 
of  the  form  and  sensitivity  of  escape  and  capture  regions 
have  resulted  from  this  work.  However,  the  optimal  control 
sequences  (strategies)  produced,  sometimes  differ  considerably 
from  those  expected  from  experience,  thus  questioning  the 
validity  of  that  approach.  For  example,  the  slower  machine, 
unless  considerably  more  maneuverable  than  its  opponent,  will 
always,  under  optimal  play,  be  captured.  Thus  the  slow  evader 
will  invariably  turn  away  from  the  pursuer  until,  at  some 
stage,  he  flies  directly  away,  and  is  then  caught  directly 
from  the  rear  (Ref  9:80)1  In  actual  combat,  the  objectives 
may  change.  Assuming  the  performance  of  each  aircraft  and  its 
weapon  system  is  in  roughly  the  same  class,  then  usually 
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each  combatant  is,  at  some  stage,  hoping  for  success.  Thus 
the  strategies  arc  not  necessarily  ones  of  pure  pursuit  or 
escape  and  should  reflect  an  aggressiveness  by  each  combat¬ 
ant  . 

Based  on  Baron's  approach  to  this  "dogfight"  situation 
(Ref  3:55),  the  problem  of  serial  combat  is  considered  to 
consist  of  two  main  parts: 

1.  Given  the  states  of  two  players,  determine  the 
role  ot  each.  Here  role  determination  is  taken  to  mean:* 

a.  Which  player  has  an  advantage,  and  to  what  extent 

b.  Based  on  this  relative  advantage,  what  is  the 
objective  of  each  player? 

2.  Given  the  roles,  what  are  the  optimal  strategies 
for  each  player? 

Of  the  two  major  parts,  role  determination  is  the  unique  and 
vital  aspect  of  the  combat  problem.  Other  work  has  been  done 
on  role  determination  (Refs  10,  11);  the  important  difference 
here  is  that  the  payoff  emphasizes  the  relative  angles 
between  the  combatants  at  termination. 

Thesis  Objectives 

The  primary  objective  is  to  study  the  validity  of 
using  the  general  purpose  payoff  in  role  determination.  The 
secondary  objective,  although  to  a  large  extent  inseparable 
from  the  first,  5-  to  develop  a  technique  for  using  this  pay¬ 
off  to  analyze  fully  simple  problems,  and  to  form  a  base  for 
the  addition  of  more  realistic  aircraft  dynamics. 


Basic  Game  Model 

The  combat  is  modeled  as  a  two  person  (aircraft),  free 
time,  zero  sum,  perfect  information  game.  A  zero  sum  game 
implies  pure  competition,  and  although  combat  games  aTe 
generally  regarded  as  being  in  this  class,  they  need  not 
necessarily  be  so.  For  example,  if  twc  opponents  are  both 
seeking  to  close  with  each  other,  they  may  both  co-operate 
to  minimize  the  tine  taken  to  do  so.  Thus  a  more  realistic 
model  should  be  partly  co-opeTative  (i.e.  non  zero  sum). 

Games  of  that  type  are  considerably  more  complicated,  and 
thus  it  is  felt  that  results  based  on  a  zero  sum  model  are 
a  necessary  first  step.  The  information  assumption  is 
unrealistic  under  any  circumstances:  however,  it  is  felt 
that  obtaining  a  workable  technique  based  on  perfect  in¬ 
formation  is  a  necessary  simplification  in  the  initial  in¬ 
vestigation  of  the  payoff. 

Dynamic  Model 

Ir.  modeling  the  aircraft  dynamics,  the  common  assumption 
of  a  point  mass,  a  flat  earth,  and  constant  gravity  are  made. 

In  addition,  the  speed  and  altitude  of  each  aircraft  is  also 
held  constant.  The  last  two  are  very  limiting,  but  work  has 
indicated  that  including  these  as  variables  in  the  formulation 
increases  the  number  of  dimensions  to  a  point  that  makes  in¬ 
terpretation  of  results  vary  difficult.  (Refs  6:47,  9:11S). 

These  references  also  indicate  that  for  a  given  game  formulation, 
simple  dynamic  models  give  a  fait  indication  of  whet  to  expect 
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from  more  complicated  dynamics  (Refs  6:46,  9:193). 

State  and  Control  Variables 

The  above  assumptions  result  in  a  six  dimensional 
unbounded  game  space  consisting,  for  each  aircraft,  of  two 
position  co-ordinates  in  a  constant  altitude  horizontal  plane, 
and  a  heading.  Six  dimensions  lead  to  relatively  straight¬ 
forward  dynamic  eouations  ;  however,  the  greater  the  number 
of  variables,  the  more  difficult  it  becomes  to  interpret  the 
-esults.  What  is  required  for  interpretation  is  as  few 
state  variables  as  possible,  having  close  correspondence 
to  the  states  which  pilots  actually  use.  Kith  a  relative 
co-ordinate  system  on  one  aircraft,  the  number  of  states 
is  reduced  to  three. 

The  aircraft  are  designated  A  and  B,  and  a  body  fixed 
otating  co-ordinate  system  is  located  on  A,  with  its  X  axis 
in  the  direction  of  the  longitudinal  axis  through  A's  nose. 

In  this  dynamic  model  this  direction  corresponds  to  the  dir¬ 
ection  of  A's  velocity  at  a  given  time.  As  shown  in  Fig.  1, 
three  independent  state  variables  remain.  The  state  vectors 
used  are  (x(t),  y(t),  t(t)}  and  (r(t),  8(t),  i(t)},  and  it  is 
convenient  to  use  both  in  the  development. 

The  controls  are  the  rates  of  turn  of  A  and  B,  UA  (t) 
and  U_  (t),  respectively.  Each  is  assumed  to  be  constrained 

O 

In  either  direction  by  a  constan  maximum  turn  rate, 
and  UgMAX *  resP0Ctlvely.  To  be  realistic,  a  model  should 
include  instantaneous  and  sustained  nax-rate  capability. 
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where  «  speed  of  A 

Vg  *  speed  of  B 

and  both  and  are  constant. 


The  Terninal  Constraint 


The  conditions  needed  for  a  successful  conclusion 
by  either  A  or  B  (and  conversely  failure  for  the  other) 
depend  largely  on  their  respective  veaooo  systems.-  The 
most  common  condition  used  to  end  games  is  a  fixed  final 
range  from  the  target.  In  the  pl3rar  case,  this  generates 
a  circle  of  radius  L  (the  weapon  range)  in  the  xy  plane 
around  one  of  the  combatants  (Fig.  1,  page  8).  Other  approaches 
have  been  used  to  create  .aare  realistic  terminal  conditions: 

■ore  complex  areas  such  as  elliptical  kill  probability- 
regions  rotating  behind  each  combatant  (Ref  6:7)  ,  and  simple 
ones  such  as  "gun  spikes"  pointing  directly  forward  of  each 
aircraft  (Ref  10).  Use  of  ellipliial  regions  in  the  role 
problem  requires  two  such  areas  rotating  behind  the  respective 
aircraft;  this  ranidly  becomes  unworkable.  The  "gun  spike" 
concept  has  been  used  for  role  dettrminstlon  with  a  range 
payoff,  but  suffers  from  being  c  "borcsight  only"  condition. 

The  popularity  of  a  circular  terminal  condition  is 
partly  due  to  the  relative  ease  of  obtaining  closed  form 
solutions,  and  it  is  largely  for  that  reason  it  is  used  in 
this  study.  The  condition  is  defined  as 

-  y\  ”  l2  -  *  *  0  (2-7) 


r\ 


j 
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r2  -  L2  =  t|>  =  0  (2-6) 

where  (x^,  y^,  z^)  and  (tj,  6f>  Zf)  are  the  final  states  and 
i.  is  the  weapon  range.  This  constraint  has  to  be  satisfied 
for  the  game  to  end;  how  it  ends,  however,  is  determined  by 
the  payoff. 


The  Payoff 

For  this  class  of  game,  the  payoff  (or  cost  function) 
represents  the  objective  of  the  players  throughout  the  game. 

It  is  thus  fundamental  to  question  of  role.  Lynch's  general 
purpose  payoff,  with  A  minimizing  and  B  maximizing  is 

J  -  a  cos  (/)tf‘bsin  (2-9) 

c  cos  eB  fdt 
BtfJ 

where  a,  b,  c  are  constant  weighting  factors  and  0^,  0fi  are 
the  angles  between  the  velocity  vectors  of  A  and  B  and  the 
position  vector  between  them  (Fig.  1,  page  8). 

Changing  the  form  of  Eq  (2-9)  and  rewriting  in  terms  of 
the  polar  co-ordinate  system  (r,  0,  z) ,  the  payoff  becomes 


j  »  !  [a  cos  (zf-0f)  ♦  b  cos  0f]  ♦ 

f 

C  COS  dt 


(2-10) 


The  middle  terms  represents  the  attempt  by  each  player  to  end 
the  game  (be  it  success  or  failuro)  with  his  best  possible  com¬ 
bination  of  angle  off  and  aspect  angle.  The  last  tern  indicates 
that  when  |zf  -  6f|  is  small,  then  time  becomes  a  critical 


G 
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factor.  Increasing  the  importance  of  this  term  is  intended 
to  polarize  the  roles  of  each  player  when  termination  is  close, 
by  the  purser  seeking  to  terminate  quickly,  and  the  evader 
trying  to  delay.  The  payoff  was  investigated  in  Reference  9 
for  two  simple  dynamic  models,  with  a=b=0.  The  emphasis 
in  this  work  is  the  case  were  c=0  (time  is  not  included 
explicitly  in  the  payoff). 

Outcomes  to  the  Game 

There  are  four  possible  outcomes  to  a  combat  (Ref  3:61); 
letting  x  denote  the  state  of  the  game,  these  outcomes  are 


described  by  the  terminal  constraints 

x  e  •*  A  is  successful  and  destroys  B 

X  e  ipB  •*  B  is  successful  and  destroys  A 

x  c  n  <>B  =  il>AB  -»  Both  A  and  B  are  des¬ 

troyed  (mutual  kill) 


£  E  *A  U  <<„  * 


Neither  Anor  B  des¬ 


troyed  (draw) 


(2-11) 

(2-12) 


(2-13) 


(2-14) 


where  ily  and  <>B  are  the  sets  of  states  corresponding  to  suc¬ 
cessful  outcomes  for  A  and  B  respectively.  In  its  basic  forn, 
the  game  terminates  the  first  time  £  enters  either  <>B,  or 
If  it  enters  none  of  these  sets  (i.e.  £  €  i|>D) ,  there  is 
a  draw  (the  result  of  the  game  is  inconclusive). 

For  player  A,  outcome  (2-11)  is  obviously  most  preferable, 
and  outcome  (2-12)  is  least  preferable.  The  preference  order¬ 
ing  of  outcomes  (2-13)  and  (2-14)  is  not  so  obvious  (Ref  3:64). 

Using  the  single  general  purpose  payoff  J  to  represent 
the  outcome  criteria  in  a  zero  sum  game,  the  outcomes  (A 
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minimizing  and  B  maximizing) are  defined  as: 

J  i  ■*  if  E  *1  ■*  success  for  A 
J  >  Jj  «  ij  c  ■*  success  for  B 

JA<  J  <  JB  ■*  ?-f  E  '•'a  u  draw 


where  J  . 

jam 

the  mutual 

unless  J. 

A 

Eqs 


<  J  <  J  and  J_  >  J. 

—  —  max  B  —  A 

kill  outcome  will  not  occur  with  this  payoff 

“  V 

(2-15)  to  (2-17)  enarle  a  game  of  kind  to  be 


formu’ated.  The  problem  of  solution  in  the  large  is  to 
determine  which  of  the  outcomes  result  from  any  set  of 
starting  points  in  the  game  space. 
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III.  Problem  Formul ation 

The  purpose  of  this  chapter  is  to  formulate  a  differ¬ 
ential  game  on  the  basis  of  Chapter  II  and  the  theory  junmar- 
ized  in  Appendix  A.  The  Lynch  payoff  is  cr*d  to  estrblish  a 
game  of  kind,  and  the  influence  of  the  weigh i  factors  in 
the  payoff  is  considered.  The  necessary  cond;'ions  are  ap¬ 
plied,  and  possible  control  combinations  are  derived. 

Payoff  Formulation 

The  Lynch  payoff  of  Eq  (2-10)  can  be  expressed  as 
rtf 

J  *  P  *  Q  Jt o  dt  (3_1) 

where  P  *  (a  cos  (tf-6.)  ♦  b  cos  0f) 

and  Q  »  c  cos  (zf-8f) 

As  time  is  not  usually  the  overriding  factor  in  combat  (apart 
from  total  engagement  limitations),  it  is  not  considered  here 
explicitly  in  the  payoff.  With  c*0,  the  payoff  becomes 

J  •  «  »  -  \  (a  cos  (zf-6f)  -  «  COS  ef)  (3-2) 

where  zf  is  the  angle  off,  and  (zf-8f)  is  the  aspect  angle  of 
B  w.r.t.  A  (Fig.  1,  page  8  ). 

The  relationship  of  the  payoff  to  the  terminal  surface 
of  Eq  (2-8)  is  shown  in  Fig.  2  (page  14).  Linesof  constant 
payoff  (isocosts)  are  drawn  on  a  flattened  terminal  surface 
for  a»b»l.  The  isocosts  enclose  two  arens,  one  around  A's 
best  success  point,  and  the  other  around  B's.  If  limits  are 
set  on  the  payoff,  a  game  of  kind  is  established,  and 
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FIr.  2.,  Isocosts  01.  the  Terminal  Surface  for  a*b* 
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the  isocosts  for  those  limits  enclose  areas  that  represent 
the  allowable  z^  and  8^  combinations  for  a  successful  out¬ 
come  by  A  an  B.  Using  Eqs  (2-1S)  to  (2-17),  in  terras  of 
$,  the  outcomes  (A  minimizing  and  B  maximizing)  are 


0  <_  $  <  -►  success  for  A  (3-3) 

-  *  -  a*b  success  fcr  B  (3-4) 

->  inconclusive  draw  (3-S) 


where  and  dg  may  pe  set  to  reflect  the  upper  limits  of 
A  and  B's  weapon  system  at  terminal  range.  The  neutral  out¬ 
come  4>n  i*  given  by 


♦* 


a*b 

~r 


(3-6) 


and  includes  head-on  and  tail-to-tail  outcomes  as  draws 
(Fig-  2,  page  14). 

The  size  of  the  succesc  areas  can  be  adjusted  by  the 
limits  set  on  the  payoff  ($^  and  $g) ,  and  their  shape  can 
be  adjusted  by  the  weighting  factors  a  and  b.  Increasing  a, 
increases  the  relative  importance  of  (z^  -  6„):  and  similarity 
increasing  b,  increase;  the  relative  importance  of  8f.  The 
effect  this  has  on  the  shape  of  success  areas  is  shown  in 
fig.  3  (page  16).  In  this  way,  the  payoff  can  be  adjusted 
Jo  represent  the  angular  conditions  required  at  termination 
for  a  wide  range  of  weapon  systems.  For  example,  consider 
that  both  A  and  B  require  a  close  approximation  to  boresight 
for  their  particular  weapon  systems,  and  have  to  be  within  a 
certain  aspect  angle  to  the  rear  of  their  opponent.  With  the 
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-40  -30  -20, 


Boresight  line/  X 

-V  A 


20  30  40  ej 


Boresight  line 
for  A 


Fig.  3.  Variation  of  Isocosts  with  Weighting  Factors 
co--i4inate  system  used,  this  implies  A  wou!-d  require  only 
a  small  variation  in  j8j|,  implying  a  large  value  of  b  in 
the  payoff;  whereas  B  would  require  only  a  small  variation 
in  | zf  -  9f I »  implying  *  large  value  of  a  in  the  payoff.  This 
is  shown  in  Fig.  4.  It  is  interesting  that  because  of  the 
use  of  a  .otating  co-ordinate  system,  the  form  of  the  payoff 
for  success  by  A  and  B  is  different,  even  though  their  con¬ 
ditions  for  success  are  the  same.  Within  the ‘limits  of  -Ms 
study,  this  difference,  and  the  implied  lack  of  a  clearly 
defined  neutral  outcome,  is  not  considered  explicitly. 


Amplication  of  Necessary  Conditions 


The  Hamiltonian  is  formed  from  Eqs  (2-1)  to  (2-6)  as 
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6.  Range 

r 

I 

f  Range 

A's  success 
(ai 

A 

\.s>— 


Zf-9f  Rang 


6f  Range 


B's  success 


Fig.  4.  Close  to  Boresight  Terainvt'en  for  A  and  B 

H  *  ha  *  hb  *  Ho  C 


where  HA  •  and  SA  -  Xxy  -  \yx  -  Xt  -  -A6  -  \  ( 

HB  *  SBUB  ,nd  SB  ’  XZ  ( 

H0  “  Va  cos  1  *  *ln  *)  *  *  v. 


■  V#  (Xr  cos  (i  -  0)  ♦  -2.  sin  (i  -  6)), 

xe 

VA(~  sin  0  -  Xr  cos  0) 


Co-states.  The  co-state  equations  are 
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‘  UA  Xy 

(-•-li: 

«-U.  X 

(3-12; 

A  x 

«  V„(X  sin  2  -  X  cos  2) 

B  x  y 

(3-13 

and  in  polar  form 

X„ 

X'  -  --(V.  sin  6  ♦  V„  sin  (2  -  6))  (3-14 

r  1  a.  d 


Xg  -  -Xr(VA  sin  e  ♦  VB  sin  (z  -  0))  - 
X0 

y^(VA  cos  0  -  Vg  cos  (2  -  0)) 

X-  -  X  V  sin  (2  •  8}  •  J  V  cos  (z  -  0) 

Z  To  r  D 

Transversality.  Adjoining  the  constraint  of  Eqs  (2-7) 
and  (2-8)  to  the  payoff  of  Eq  (3-2)  gives  the  transversality 
conditions . 

y2  x 

A  ,  ((a  cos  2-  ♦  b)  -  a  sin  2.)  ♦  vx,  (3-17 

*i  2L3  f  yf  *  1 

x2  y 

A  «  — ,  (—  (a  cos  2,  ♦  b)  -  a  sin  2.)  +  vy,  (3-18 

yf  2L3  xf  * 


(3-15 

(3-16 


£5 


O 


O 
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Also,  as  tine  is  free 

H(tf)  -  H(t)  =  0  (3-23) 

The  unknown  nultiplier  u  can  be-  eliminated  (Appendix  B 
page  77  )  to  give 


Ay  «  [L  sin  0f  (uA  b  sin  0f-uB  a  sin  (zf-6f))  * 

(Vg  cos  zf'vA)  0>  sin  6f  -  a  sin  (zf-6f))]/  (3-24) 


2L(Vg  cos  (tf-6f)  -  VA  cos  6f) 

[2L  Au  cos  0f  -  (b  sin  Bf-  a  sin  (zf-6f))]/ 


‘f  yf 

2L  sin  9 


(3-2S) 


Non-Singular  Controls.  H  Is  linear  in  UA  and  Ug  so 
singular  arcs  are  possible:  for  the  non-singular  case  minA 
■axg  II  gives  the  possible  controls  oA,  ofi  as 

(3-26) 


°A  *  '(Sgn  V  Vax 


°B  *  (sgn  SB)  UBnax 


(3-27) 


Singular  Controls .  Necessary  conditions  for  A  to  have 
a  singular  arc  give 


o 


*  0  «•> 

V 

*  Xz 

and  A.  »  -A- 

l  *3 

(3-28) 

«  0 

A 

•  0 

and  A„  ■  -rA  tan  6 

(3-29) 

y 

6  r 

« 

o 

♦ 

°A 

>  0 

(3-30) 

IS 


a 
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-V,  X  -►  X  >0 
Ax  x  - 


(3-31) 


hence  A  may  have  a  singular  control  of  zero  turn  rate. 
Similarly,  the  application  of  singular  necessary  conditions 


for  B  give 


(3-32) 


<SB>  ’ 
°B 


»  X  tan  z  and  X„  r  X  tan  (z-8) 
y  x  8  r  1  ' 

(3-33) 

B  *  0  (3-34) 

X*  1  0  for  “f  <  zf  <  I 

(3—75) 

X  <  0  for  £  <  z,  <  ^5. 

X  -  2  f  2 


Hence  B  may  al«"  have  a  singular  control  of  zero  turn  rate. 

If  the  conditions  of  Eqs  (3-28)  to  (3-35)  are  all  satisfied, 
then  both  A  and  B  may  be  singular  at  the  same  time.  Satisfying 
these  conditions  requires  that  y(t)  *  0  and  z(t)  »  0,  180*. 
Consequently,  a  direct  tail  chase,  a  head-on  or  a  tail-to- 
tail  are  the  only  situations  where  both  may  be  singular. 

Control  Combinations.  The  non-singular  controls  re¬ 
present  maximum  rates  of  turn  fur  A  and  B  in  either  direction, 
the  singular  controls  represent  straight  dashes.  There  are 
thus  a  total  of  nine  possible  control  combinations  (o^  • 

*  UA-nx'  0  “lth  °B  *  *  U8-.x'  °>* 
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IV.  Closed  Form  Solutions 

Analysis  is  very  much  easier  if  as  much  of  the  game 
as  possible  is  solved  analytically.  This  chapter  presents 
closed  form  results  for  the  states  and  co-states  along 
continuous  control  arcs,  in  terms  of  the  time-to-go,  and 
the  final  states  and  co-states.  To  solve  the  game  fully 
it  is  necessary  to  identify  the  various  surfaces  separating 
these  arcs.  Some  closed  form  expressions  are  derived  for 
these  surfaces.  Finally  these  solutions  are  related  to  the 
terminal  surface  and  payoff.  Details  of  the  derivations 
are  contained  in  Appendix  B. 

States  and  Co-states 

Over  arcs  on  which  the  controls  oA  and  <jg  are  constant, 
tne  state  and  co-state  equations  are  linear  and  can  be  solved 
fairly  easily,  at  least  in  the  cartesian  form  (Appendix  B 
pages  77  to  81  ),  to  give,  for  non-singul*t  arcs  (o^  *  ±  U ^ 

<j  «  ±  U„  ) 

B  Bmax' 

x(t)  *  xf  cos  oAt  ♦  yf  sin  a^t  ♦  Rg  [sin  (oAt-zf)  ♦ 

(4-i) 

sin  ((<JB*®A)  t  ♦  *f)l  -  RA  sin  aA4 
y (t)  »  -xf  sin  oAt  *y  {  cos  oAt  ♦  Rg  [cos  (aAt-zf)  - 
cos  ((°B-°A)  t  ♦  zf)]  ♦  raU-  cos  oAt)  (4-2) 


(I 

I 

I 

l 

i 

( 
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t(t) 

■  <°B  -  0AJ  1  *  *f 

(< 

Ax(t) 

*  A  cos  o  t  ♦  A  sin  o,  t 

*£  A  7£  A 

(4 

Ay  (t) 

«  -A_  sin  o.  t  ♦  A„  cos  o,  t 
xf  A  yf  A 

(4 

Az(t) 

aKt  °nt 

*  Xzf  *  2RB  Sin  "I-  'Xyf  cos  t-f  *  * 

(4 

A  sin  (V  ♦  z  )] 
i  2  1 

where  ■  VA/°A  and  Rg  *  Vg/Og,  the  radii  of  turd  of 
A  and  B  respectively,  and  the  final  tine  at  the  edd  of 
a  given  arc  is  zero,  with  tine  backwards  aloi^  the  arc  being 
negative  (t  <0,  t  <  0,  tf  »  0). 

For  singular  arcs  nodified  solutions  are  generated; 
for  singular  A  (oA  «  0,  Og  «  i  Ug^) 

x(t)  *  *  VA  t  ♦  RB  (sin  pf  ♦  oBt)  -  sin  if)  (4 
y(t)  “  yf  '  RB  (coS  (  °B  1  *  *f)  *  COs  *f)  (4 


(4 

(4 


Ar(t)  «  A  -  2Rg  sin  (-y-  ♦  zf)  sin 

and  for  singular  B  (oA  «  ♦  UA  ,  *  0) 

x(t)  •  xf  cos  cA  t  ♦  (yf  -  Ra)  sin  ^  t  * 
V#  t  cos  (zf  -  oA  t) 


y(t)  *  *xf  sin  <JA  t  ♦  (yf  -  RA)  cos  oA  t  ♦ 
V,  t  sin  (z.  -  o,  t)  ♦  R. 


(4 
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t(t) 

*  *f  -  aA 

t 

(  4  —  1 S ) 

»  A  cos 

xf 

°A  *  ♦  Ay^  sin  oA  t 

(4-16) 

Ay(t) 

« -A  sin 

xf 

°A  *  *  Xyf  cos  °A  t 

(4-17) 

A  (t) 
z 

«  A  »  0 

(4-18) 

Using  Eqs  (4-1)  to  (4-18)  nine  different  arcs  can  be  con¬ 
structed  from  the  nine  possible  combinations  of  control. 

Game  Surfaces 

The  game  space  is  partitioned  by  surfaces  which  re¬ 
present  discontinuities  in  the  controls  and  junctions 
between  the  various  types  of  arcs.  For  a  full  solution  to 
the  game  it  is  necessary  to  identify  these  surfaces,  and  the 
regions  they  partition.  Appendix  A  discusses  some  of  the 
more  important  surfaces,  and  the  conditions  necessary  for 
their  solution. 

Switching  Surfaces .  Switching  of  A's  controls  (other 
than  to  zero)  occurs  when  S^  ■  0  and  S^  i  0.  Similarly 
switching  for  B  (other  than  to  zero)  occurs  when  SR  *  0 
and  Sj  i  0.  Each  of  these  conditions  defines  a  five  dimension 
surface  in  the  (x-X)  space,  which  is  reduced  to  four  dimensions 
by  the  free  time  conditions  of  Eq  (3-23).  For  a  given  arc, 
a  closed  form  solution  can  be  found  that  gives  the  switching 
time  (backward  from  the  final  conditions)  in  terms  of  the 
final  states  and  co-states.  The  switching  tine  is  then  a 
parameter  that  specifies  a  point  in  the  switching  surfc.ee. 


Fig.  S.  Switching  Surfaces  and  Singular  Arcs 
Fig.  5.  shows  a  two  dimensional  illustration  of  a  switching 
surface  for  A. 

A's  Switching  Suface.  For  an  A  switching  surface, 
the  condition  S^  «  0  gives 

Ax  ys  *  Ay  x*  -  Xx  »  0  (4- 


(4-19) 


where  the  states  and  co-states  of  a  junction  point  between 
an  arc  and  the  surface  are  (xs,  Xs) .  Substitution  in  Eq  (4-19) 
of  the  closed  fora  state  and  co-state  solutions  for  an 
arc  gives 


\tlyt  *  RA(1  *  cos  Vs”  *  Xytlxt 


R.  sin  O.t  1 


(4-20) 
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o 


0 


where  ts  is  the  time  (negative)  at  which  the  control  switches 
going  backwards  along  the  arc  (T  <  0,  t£  «  0) e  Hence  Eq 
(4-20)  and  Eqs  (4-1)  to  ( <* - 3 )  are  a  parametric  solution  for 
the  switching  surface.  The  additional  free  time  requirement 
of  H  «  0  in  Eq  (3-23)  gives 


(Xx  ys  '  Xy  Xs)  °B  *  VB  (Xx  C0S  Xs  * 
s  7s  s 

sin  xs)  -  A  VA  =  0 

which,  on  substitution  of  Eqs  (4-1)  to  (4-6)  reduces  to 


(4-21) 


RB[xyf  (v  iin  Vs  ‘  Sl"  *f>  * 

X_  (v  cos  a.t  -  cos  z,)]  -  X,  «  0 

A  S  t  Z  e 


(4-22) 


*£  ns  s  &£ 

where  v  »  VA/VB,  the  speed  ratio  of  A  to  B. 

B ' s  Switching  Suface.  For  a  B  switching  surface,  the 
condition  Sg  «  0  gives 

X  «  0  (4-23) 


o 


Using  the  solution  for  Xz(t)  of  Eq  (4-6),  in  terms  of 
the  final  states  and  co-states  and  the  switching  time,  the 
B  switching  surface  is  given  by 


Xtf  *  RBlXy£  {sln  *f  '  sin  <Vs  *  1i)  * 
Xx^  (cos  tf  -  cos  (Ogt5  ♦  zf)) 


(4-24) 


The  additional  free  time  condition  of  Eq  (3-23)  must  also 
hold  for  the  B  switching  surface  and  gives 


(X*.y*  •  W  °A  *<VXx 


‘s>  + 


X  sin  x  )  -  X  v. 
ys  *  xs  * 


(4-2S) 
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or  in  terms  of  the  final  conditions  and  t 

s 


-X.A  *  YfXr 

f  yf  f  Xf 

♦  V*  + 

X]  - 

R.X 

A  xf 

>  0 

(4-26) 

nhclc  A 

*  A  (cos  Z,  + 

Xf  f 

£«r  -  i) 

COS 

(Vs  * 

tf)) 

Y 

«  A  (sin  zf  ♦ 

yf  1 

(or  -  1) 

sin 

(•Vs  + 

tf)) 

or  or 

■  Cg/oA,  the  turn  ratio 

of  B 

to  A 

Singular  Surfaces .  Singular  surfaces  are  reduced 
forms  of  switching  surfaces;  in  addition  to  the  switching 
functions  being  zero,  at  a  junction  with  a  singular  surface 
it  is  also  necessary  that  the  conditions  of  Eqs  (3-29)  to 
(3-31)  are  satisfied  for  A  (Eqs  (3-33)  to  (3-3S)  for  B) . 

Arcs  that  satisfy  these  conditions  and  switch  to  singular 
controls  at  a  point  on  the  singular  surface  will  remain  on 
it  (going  backwards  in  time),  until  deliberately  forced  off 
(Fig.  5,  page  24).  Conversely,  going  forward,  an  infinite 
number  of  trajectories  join  the  singular  arc.  The  additional 
condition  of  H  >  0  reduces  the  dimension  of  the  surface  to 
three,  and  thus  a  surface  in  the  state  space. 

A's  Singular  Surface.  For  an  A  singular  surface, 

Eq  (3-29)  gives 

X  ■  0  (4-27) 

7s 

Using  the  solution  for  Xy(t)  given  in  Eq  (4-5),  in  terms 
of  the  final  conditions  and  the  switching  time  the  singular 
condition  becomes 


**f  5in  Vs 


xrt  cos  V-  '  0 


(4-28) 


The  free  time  requirement  still  exists;  substituting 
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Eqs  (3-28)  to  (3-30)  into  II  =>  0  gives 


sp-  ♦  cos  2  -  v  «  0 

Kg  S 


(4-29) 


This  is  the  equation  in  state  space  of  the  A  singular 
surface.  Arcs  that  intersect  this  surface  and  satisfy  the 
singular  switching  conditions  of  Eq  (3-28)  to  Eq  (3-31) 
switch  to  singular  A  control,  and  remain  on  the  surface, 
with  their  trajectories  given  by  Eqs  (4-7)  to  (4-12). 

In  cerms  of  the  final  conditions  and  switching  time,  Eq 
(4-29)  becomes 

*'W  ■  xf  sin  Vs  *  yf  cos  Vs  + 

RB  cos  *  *f)  ♦  RA  sin  (1  -  cos  a^ts)  *  0  (4-30) 

B 1 s  Singular  Surface.  For  a  B  singular  surface, 

Eqs  (3-32)  and  (3-33)  give 


0 


and  A  sin  z  -  X  cos  2  »  0 

xs  *  ys  s 

which  in  terms  of  the  final  conditions  and  t$  become 


and  sin  (o„t  ♦ 
X «  os 


V  - 


(Vs 


V 


(4-31) 

(4-32) 


(4-33) 

(4-34) 


As  for  A,  the  free  time  requirement  must  be  satisfied,  and 
substituting  Eqs  (3-32)  to  (3-34)  into  H  »  0  gives  a  B  sing¬ 
ular  surface  in  state  space  as 

(y^  cos  zs  -  sin  *S)RA  *  (1/v)  -  cos  zs*  0  (4-3S) 

In  terns  of  the  final  conditions  and  t  ,  the  condition  is 

% 


© 


o 


V0A  •  lx£  sin  •  (yf- 

Ra)  cos  iC8ts»zf)  +  Rg  (1  -  cos  0Bts)]  =  C 


(4-36) 


In  Chapter  III  page  20  it  was  shown  that  the  only 
possible  conditions  for  both  A  and  B  to  be  singular  at  the 

sane  tine  were  v(t)  *  0  and  z(t)  =  0.-  Substitution  of  this 

into  the  equations  for  the  singular  surfaces  (Eqs  4-2»")  and 
(4-3S) ,  with  oA  »  0  and  Og  =  0,  gives  v  ■  1.  This  implies 
that  unless  the  speeds  of  A  and  B  are  the  sane,  the  joint 
singular  case  does  not  exist,  and  a  direct  tail  chase,  head- 
on,  or  tail-to-tail  will  not  occur,  other  than  instantan¬ 
eously. 

Dispersal  Su  -faces.  From  Appendix  A,  the  major  re¬ 
quirements  of  points  on  a  dispersal  surface  are  that  two  or 
more  paths,  with  differing  control  combinations  on  each, 
intersect,  and  that  the  payoff  at  termination  is  the  same 

for  each  path.  For  two  paths  this  condition  can  be  expressed 


where  x  ,  and  x  ore  the  state  vectors  of  each  path  at 
1  ^2 

the  dispersal  surface,  and  and  are  their  respective 
terminal  payoffs.  If  paths  travel  directly  fron  the  dis¬ 
persal  surface  to  the  terminal  surface  without  switching 
controls,  as  shown  in  Fig.  6,  then  using  the  closed  form 
solutions  for  states  and  co-states  over  each  constant  control 
arc,  the  dispersal  surface  can  be  expressed  in  terms  of  the 


i 
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where  t.  and  t,  are  the  tines  of  intersection  alo->g  paths 
1  and  2.  Equations  (4-39)  to  (4-43),  together  with  the  free 
tine  condition  for  each  path,  the  terminal  constraints,  and 
the  transversality  conditions,  fully  define  a  dispersal  sur¬ 
face.  A  more  condensed  form  was  not  found;  however,  the 
above  was  used  as  the  basis  for  a  numerical  search  for  the 
dispersal  surface. 


Solutions  at  the  Terminal  Surface 

To  complete  the  solution,  trajectories  and  surfaces 
in  the  game  space  are  analyzed  backwards  from  the  terminal 
surface;  the  first  step  is  to  find  the  controls  at  the  TS, 
and  any  intersections  with  other  surfaces. 

Controls.  For  the  controls  at  the  TS,  the  condition 


of  Eq  (4-19)  gives 


SA  *  Xxf*f  "  Xyfxf 


(4-44) 


S  •  A 
B  z. 


(4-45) 


Substituting  in  the  transversality  conditions  of  Eqs 
(3-21)  and  (3-22)  for  the  co»states,  results  in 


SA  »  -b/2  sin  6f 
SB  ’  I  5in  (lf  '  V 


(4-46) 


(4-47) 


Using  the  conditions  of  Eqs  (3-26)  and  (3-27)  the  nonsingular 


controls  on  the  TS  are 


UAMAX  f°r  0  <  Pf  <  * 
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°Af  =  UAMAX  for  -”  <  9f  <  0 
°Af  *  U6MAX  f0r  0  4  <VV  <  * 
°Bf  "  UBMAX  for  -1'  <  ‘VV  <  0 


(4-48) 


This  control  distribution  is  shown  in  Fig.  7  (page  32'. 

As  Lynch  points  out  (Ref  9:191)  the  controls  are  the  exact 
opposite  of  those  produced  in  the  classical  barrier  problem. 
In  this  case,  with  the  generalized  payoff,  A  is  turning  into 
line  of  sight  A  to  B,  and  B  is  turning  into  line  of  sight 
B  to  A.  lienee  using  this  payoff,  the  terminal  controls  are 
aggressive.  In  a  defensive  position,  each  aircraft  continues 
turning  when  at  the  opponents  kill  range,  in  an  attempt  to 
increase  angle  off  and  aspect  angle  and  maneuver  into  an 
attacking  position. 

TS  Switching  Surfaces .  The  points  on  the  TS  where 
the  switching  functions  S  and  S  are  zero  are  intersection 

A  D 

points  of  the  TS  with  switching  surfaces.  The  lines  of 
intersection  thus  formed  are  shown  in  Fig.  7  and  are  given 
by 

S^  »  0  ♦  A  switching  line  of  •  o,  a;  for  -it  <  zf  <  a 
Sj  *  0  ♦  B  switching  line  of  (zf-9^)  ■  0,  it;  for  (4-49) 
-*  <  0f  <  a 

If  S’  *  0  or  Sg  «  0,  the  possibility  exists  for  singular 
arcs  to  terminate  on  those  switching  lines.  For  A,  the 
singular  condition  of  Eq  (3-29)  with  6^  *  0,  it  gives 


s:  -  -v.x.  /L 
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Fig.  7.  Controls  at  the  Terminal  Surface 
Substituting  in  the  terminal  conditions  of  Eq  (1-211 


results  in 


•V*  sin  *f 


Using  this,  A's  controls  along  the  switching  lines  >  0, 


0A(tf-)  -  -UAHAX  for  0  <  :{  <  t 

oA(tf->  *  *UAMAX  f0r  -11  <  *f  <  0  (4'52 

0A(tp  ■  0  for  *-  -  0,  a 

Hence  the  only  co-ordinates  at  which  an  A  singular  arc  can 
terminate  at  the  TS  are  (0,  0),  (0,  a),  (a,  a)  (a,  0).  For 
>,  the  singular  condition  of  Eq  (3-33)  with  (z.  -  0.)  •  0,  a 


a 
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S’  .  — ~  (4-S3) 

Substituting  in  the  terminal  conditions  of  Oq  (3-21)  results 
in  -bVg  sin  6f 


and  B's  controls  along  the  switching  lines  z^  -  0f 


(4-S4) 

0, 


oB(tf-) 

*  +UBMAX  °  <  »{<  * 

0B(tf-) 

*  -UBMAX  for  -it  <  0f  <  0 

(4-55) 

w 

«  0  for  0^  »  0,  it 

Hence  the  only  co-ordinates  on  the  TS  at  which  a  B  singular 
arc  could  terminate  are  identical  to  those  for  an  A  sing¬ 
ular  arc.  This  implies  that  singular  arcs  only  terminate 
at  the  TS  when  both  A  and  B  are  singular.  The  further  re¬ 
quirement  of  equal  speed  (page  28  )  virtually  eliminates  the 
possibility  of  singular  arcs  at  the  TS. 

TS  Singular  Surfaces.  The  A  and  B  singular  surfaces  of 
Eqs  (4-29)  and  (4-35)  may  intersect  with  the  TS  for  certain 
values  of  the  game  parameters.  For  A,  substitution  of  the 
terminal  constraint  into  Hq  (4-29)  gives 


L  sin  6 

~ V 


f  ♦  cos  Zf  -  V 


(4-S6) 


for  the  intersection  line.  The  position  of  the  line  will 
vary  with  L,  Rg,  and  v  as  shown  in  Fig.  8  (page  34  ).  This 


33 


Fig.  8.  Projection  of  A  Singular  Surfaces  in  xy  plane 

is  because  the  singular  surface  given  by  Eqs  (4-29)  and 
(4-35)  are  necessary,  but  not  sufficient,  for  singularity. 

In  this  case,  a  terminal  cost  gives  transversality  conditions 
which  must  also  be  satisfied  for  a  singular  arc.  This 
reduces  the  A  singular  surface  to  a  curve  in  the  state  space 
which  is  very  unlikely  to  intersect  the  TS.  A  similar  argu¬ 
ment  applies  for  the  B  singular  surface. 

TS  Dispersal  Surfaces .  The  switching  lines  for  B  given 
by  Eq  (4-48)  also  satisfy  the  conditions  for  a  dispersal 
surface  (in  a  limiting  sense)  given  by  Eqs  (4-39)  to  (4-43). 
Thus  the  lines  x^  -  8^  ■  0,  a  are  the  intersection  of  a  B 

dispersal  surface,  because  only  B  has  a  choice  of  controls. 
Similarly,  the  only  positions  on  the  TS  at  which  A  has  a 
dispersal  condition  are  (0,  0),  (0,  a) ,  (a,  0)  and  (a,  a). 
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These  co-ordinates  also  lie  on  the  B  dispersal  line  and  thus 
represent  points  of  "double  choice"  or  "mixed  strategy" 
between  A  and  B. 
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V.  Analysis  of  the  Solution  in  the  Large 

To  complete  the  full  solution  of  the  game,  it  is 
necessary  to  locate  the  game  surfaces,  and  analyze  the  way 
in  which  they  partition  the  game  space.  The  method  consists 
of  constructing  trajectories  and  surfaces  backwards  from 
the  terminal  surface.  The  application  of  this  method  using 
the  closed  form  solutions  is  presented  in  this  chapter. 

Two  techniques  are  used  in  the  analysis: 

1.  Surface  Solutions  in  Closed  Form.  The  closed 
form  solutions  for  surfaces,  though  mostly  transcendental 
in  the  switching  time,  can  be  solved  numerically.  How¬ 
ever,  the  solution  does  become  very  complex  if  the 
surfaces  are  connected  to  the  terminal  surface  by 
trajectories  consisting  of  a  number  of  arcs. 

2.  Trajectory  Analysis .  Trajectories  are  constructed 
backwards  from  isocosts  on  the  TS.  The  conditions  of 
Chapter  III  pages  IS  to  2C  are  used  to  switch  controls, 
thus  generating  trajectories  composed  of  a  number  of 
arcs.  The  global  optimality  of  these  trajectories  is 
uncertain  until  surfaces  can  be  isolated  by  parameter¬ 
ization  and  projection  of  a  large  number  of  trajectories 
(Refs  10.  11). 

The  advantage  of  the  first  technique  is  that  it  gives  the 
surfaces  directly,  though  vith  difficulty  for  surfaces  not 
close  to  TS.  The  second  method  has  the  advantage  that  tra¬ 
jectories  can  be  plotted  to  allow  visual  interpretation; 
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though  this  is  very  long  and  tedious,  and  of  doubtful  ad¬ 
vantage  for  problems  modeled  with  more  than  three  dimensions 

General  Trajectory  Patterns 

In  order  to  select  some  typical  parameters  for  more 
detailed  analysis,  the  second  technique  was  used  to  indicate 
the  general  pattern  of  trajectories  for  various  game  para¬ 
meters.  The  following  main  observations  result: 

1.  Singular  arcs  occur  and  do  not  intersect  the  TS, 
thus  confirming  the  results  of  Chap  IV  page  33  .• 

2.  Some  trajectories  appear  to  contradict  actual 
experience;  for  example,  as  shown  in  Fig.  9  (page  38). 
as  B  overshoots  he  is  turning  away  from  A,  rather 
than  inwards  as  might  be  expected. 

3.  As  shown  in  Figs.  11  and  12  (pages  40  and  41  ), 
many  successful  terminations  result  from  trajectories 
that  correspond  to  a  scissors  maneuver  (Ref  7:62). 

4.  To  be  as  realistic  as  possible,  and  comparable 
with  other  work,  specific  parameters  selected  for  more 
detailed  analysis  are 

VA  *  800  ft/sec  *  0.2  Tads/sec 

giving 

»  4000  feet 

and 

Vj  ■  720  ft/sec  U^1Ax  *  0.24  rads/sec 

giving 


Rg  •  3000  feet 


A  Singular  Arc  (8.-2.007*) 


Trajectori 


Trajectories  Ending 


Scissors  Trajectory  Ending  in  a  Load  for 


Thus  the  velocity  ratio  v  =  V./V-  =  1.1,  and  the 

A  t> 

control  ratio  ubhax^UA'1AX  *  1,2>  Tile  weapon  range  is 

L  «  2000  feet,  and  each  aircraft  is  assumed  to  need 

boresight  to  within  approximately  ♦_  10*  at  an  angle 

off  of  within  approximately  40*;  this  gives 

$  »  0.02  a  *  0.2  b  *  1.8 

A 

defining  A's  success,  and 

<bB  «  1.98  a  =  1.8  b  *  0.2 
defining  B's  success. 

The  general  pattern  of  trajectories  using  these  game 
parameters  is  shown  in  Figs.  9,  10,  11,  12  and  IS.  All  of 
these  trajectories  terminate  on  the  upper  half  ((zf  -  0 1-)  >  0,  it) 
of  an  isocost,  and  because  of  inverse  symmetry  about  zf  -  0f» 

0,  a,  they  are  sufficient  to  show  the  behavior  of  trajectories 
terminating  on  the  lower  half.  A  lag  ending  for  A  is  con¬ 
sidered  as  if  >  !f  >  0  or  Zj  <  6^  <  0.  A  lead  ending  for  A 
is  considered  as  zj  <  8j  >  0  or  zf  *  ®f  *  0.  Note  also  that 
0f  «  0  is  the  boresight  condition  for  A  (Fig.  1,  page  8  ). 

Application  of  Surface  Solutions 

Analysis  of  the  trajectories  in  Fig.  9,  shows  that 
the  A  singular  arc  is  joined  to  the  TS  by  one  further  arc. 
Consequently,  the  closed  form  surface  technique  is  applied 
to  locate  the  exact  point  at  which  the  singular  arc  ends, 
and  the  point  on  the  TS  at  which  the  singular  trajectories 
terminate.  This  is  repeated  for  a  number  of  costs,  producing 
curves  that  represent  the  end  of  singular  arcs  in  the  game 
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space  (Fig.  14,  page  44  ),  and  the  ternination  of  singular 
trajectories  on  the  TS  (Fig.  IS,  page  46).  Curves  are 
generated  for  vcrious  weighting  factors. 

End  of  Singular  Arcs .  Fig.  14  shows  that  if  A’s  success 
is  strongly  oriented  to  boresight  (small  "a"  and  low  0^.  range), 
then  A  turns  out  of  his  straight  dash  into  B  earlier  (and  farther 
away)  than  he  would  for  termination  requirements  of  a  small 
aspect  angle  well  off-boresight  (large  "a"  and  low  (tf  -  0f) 
range).  Also,  as  the  cost  decreases  to  tero,  there  appears 
to  be  a  lower  limit  on  y.  This  is  found  by  using  Eq  (4-29) 
for  an  A  singular  surface.  All  the  singular  curves  must  lie 
on  this  surface,  so  that  for  a  „  >  0 

D 

ymin  *  (  v-  1)  Rb  and  a  «  0  (S-l) 

For  the  parameters  used,  ynin  is  <00  feet  and  this  agrees 
closely  with  Fig.  14.  This  indicates  that  B  has  to  be  at 
least  that  distance  off  to  one  side  for  A  to  go  singular. 

Ternination  of  Singular  Trajectories .  The  general 
pattern  of  trajectories  in  Fig.  9,  (page  38  )  suggests  that 
a  large  number  of  trajectories  result  in  a  singular  arc, 
which  itself  terminates,  via  one  more  arc,  at  one  point  on 
the  TS.  Thus  the  terminal  curves  of  Fig.  IS  are  lines  on 
the  TS  where  termination  occurs  from  a  large  number  of  starting 
positions.  The  lines  extend  over  a  very  much  lower  range 
of  0j  than  the  total  range  of  6^  possible  for  a  given  cost. 

For  example,  if  a  ■  0.2  and  $  *  0.02,  then  from  Eq  (3-3) 

|  6j  i  <_  12.6*,  whereas  Fig.  IS  shows  that  the  singular 
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trajectory  terminates  at  3^  £  2.0*.  However  proximity  to 
boresight  is  gained  by  A  at  a  cost  in  angle  off  (z^)  which 
is  high;  indeed  for  low  values  of  "a",  the  singular  curve 
gives  values  of  z^  close  fo  the  maximum  for  a  given  isocost. 
Thus  Fig.  15  can  be  used  to  select  weighting  factors  and  costs 
to  represent  the  angular  requirements,  at  a  given  range,  of 
differing  weapon  systems.  The  drawback  in  using  these  tra¬ 
jectories  is  that  they  may  be  globally  non-optimal  for  the 
solution  in  the  large;  this  requires  further  consideration  of 
the  game,  and  is  dealt  with  in  Chapter  VI. 

Application  of  Trajectory  Analysis 

Before  attempting  to  parameterize,  trajectory  projec¬ 
tions  are  analyzed  for  the  existence,  and  general  position 
of  switching  surfaces,  singular  arcs,  and  dispersal  surfaces. 
As  the  scissors  trajectories  illustrate,  points  on  switching 
surfaces  exist  in  large  numbers,  and  considerably  complicate 
trajectory  analysis. 

II  Singular  Arc.  As  shown  on  page  45  a  singular  arc 
for  A  (and  its  symmetrical  counterpart)  is  found  for  tra¬ 
jectories  that  terminate  with  some  lag  for  A  ((zf  -  6j)  >  0) 

A  singular  arc  was  also  found  for  B,  as  shown  in  Fig.  10, 
page  39  }.  However  as  this  arc  is  connected  to  the  TS  via 
two  other  arcs  ({oA  <  0,  oB  >  0) ,  (a^  >  0,  >  0)),  a  B 

singular  closed  form  solution  has  not  been  attempted.  An 
interesting  observation  is  that  the  arc  is  degenerate  in  the 
sense  that  nonsingular  arcs  joining  it  on  one  side  are  almost 
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parallel  in  the  xy  projection  (Fig.  10).  The  B  singular 
arc  thus  resembles  what  Isaacs  calls  a  semi-universal  surface 
(Ref  8:196).  This  can  be  shown  analytically  by  considering 
a  singular  and  a  nonsingular  arc  backwards  from  a  point  JC g 


(tf  «  0)  on  the  singular  arc.  The  singular  arc  state  solutions 
given  by  Eqs  f4-7)  to  (4-9)  are  subtracted  from  the  nonsingular 
arc  state  solutions  of  Eqs  (4-1)  to  (4-3)  giving  the  following 
differences  in  states  at  a  time  t  backwards  along  each  arc 
from  Xg 

oRt  °nt 

Ax(t)  -  2Rfi  sin  cos  (-|—  (tg  -  cAt))  -  Ra  sin  oAt  - 

(5-2) 

V  C.0s  (ZB  '  °At) 

Ogt  O  t 

Ay(t)  »  2Rg  sin  — j—  sin  (— j—  ♦  (tg  -  <7^)t)  * 

(5-3) 

V  Sln  (zB  ‘  V5 


Az(t)  ■  Ogt  (5-4) 

The  possible  nonsingular-arcs  are  og  »  ±  0.24  and  a A  »-0.2; 
so  that  for  t  <  3  secs,  Ogt/2  is  small  and  Eqs  (5-2)  and 
(5-3)  can  be  approximated  by 

Ax(t)  %  (RgOgt  -  VBt)  cos  (tB  -  aAt)  -  Ra  sin  aAt  fS-S) 

*y(t)  $  (RgOgt  -  V3  sin  (ZB  '  V3  (5-6) 

Substituting  >  0  gives 

Ax(t)  £  -ra  sin  “here  aA  <  0  (5-7) 

4y(t)  $  0  (5-8) 
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Thus  for  low  values  of  t,  there  is  only  a  small  difference 
in  the  x  and  y  co-ordinates  between  the  B  singular  arc  and  the 
nonsingular  arc  joining  it  with  positive  B  control.  This 
confirms  the  trajectory  plots  of  Fig.  10.  However,  as  for 
the  A  singular  case,  there  still  remains  the  question  of 
global  optimality  for  these  trajectories. 

Dispersal  Surfaces .  Direct  closed  form  solutions 
for  dispersal  surfaces  are  not  cvailable,  but  analysis  of 
trajectory  projections  from  the  limiting  isocost  indicates 
some  possible  points  on  a  dispersal  curve.  The  locations 
of  some  DS  points  are  shown  in  Fig.  9  and  13;  these  are 
special  DS  points  representing  a  choice  of  control  for  /oth 
A  and  B.  Considering  the  points  in  two  groups: 

1.  Aj  and  Bj.  The  points  Aj  and  B ^  are  similar  be¬ 
cause  they  probably  both  lie  on  the  closest  DS  to  the 
terminal  surface,  and  are  thus  likely  to  lie  on  the 
bounds  of  a  closed  region  of  the  game  space  (P.ef  6:31). 
(For  a  two  dimensional  problem  this  is  definitely  so; 
for  a  three  dimensional  problem,  closure  does  not 
necessarily  occur).  Inside  these  regions  (for  point 
Bj)  success  for  A  is  inevitable  for  optimal  play, 
regardless  of  B's  actions.  Thus  point  B^  appears  to 
be  the  closest  that  B  can  reasonably  allow  A  to  ap¬ 
proach,  in  a  direct  tail  chase  situation,  before  turning. 
If  B  delays,  then  A's  success  is  inevitable,  and  B's 
condition  at  capture  becomes  worse  the  longer  he  waits 
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to  turn.  A  similar  situation  occurs  at  A^  for  A  with 
respect  to  B's  final  success. 

Aj  and  B^.  With  the  A^ ,  B,  dispersal  points,  it  is  likely 
that  they  do  not  lie  in  the  first  DS  that  the  generating 
trajectories  intersect.  So  for  a  straightforward  game 
of  pursuit  and  evasion  with  no  role  change,  these  points 
are  likely  to  be  in  part  of  the  game  space  which  results 
in  a  draw  in  optimal  play.. 

The  location  of  these  points  and  the  DS  points  on  the  7S 
(page  34  )  will  enable  dispersal  curves  to  be  located  for  a 
gi' en  isocost;  either  by  parameterization  of  many  trajectories, 
or  by  using  a  numerical  search  technique,  based  on  the  DS 
conditions  on  page  29  ,  to  extrapolate  from  a  known  DS  point. 

Summary 

In  this  chanter,  the  closed  form  solutions  are  applied 
in  an  attempt  to  solve  the  problem  completely.  Two  techniques 
are  tried:  direct  application  of  the  closed  form  surface 
solutions,  and  analysis  of  trajectories.  Some  general  tra¬ 
jectory  characteristics  are  observed,  in  particular,  the 
correspondence  of  many  trajectories  to  a  sciscors  maneuver. 
Precise  location  of  A  singular  arcs  enables  some  criteria 
to  be  set  for  selecting  the  payoff  parameters  to  match  various 
weapon  capabilities.  A  singular  B  arc  is  found,  and  an  inter¬ 
esting  approximation  to  a  semi-universal  surface  is  confirmed. 
Finally^ some  points  on  dispersal  surfaces  are  located,  and  two 
methods  for  locating  the  surfaces  are  suggested. 
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Success  in  fully  defining  the  game  space  is  limited, 
largely  by  time.-  It  does  however  make  a  start  and  lead  the 
way  to  a  complete  solution  of  the  game. 
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VI.  The  Role  Problem 

Until  the  game  is  completely  solved,  analysis  of  the 
role  problem  is  limited.  However,  in  this  chapter,  a  method 
of  role  determination  is  discussed.  Capture  regions  can  be 
determined  for  the  game,  and  a  method  is  proposed  of  estim¬ 
ating  regions  of  advantage  which  assist  in  role  determination 
outside  capture  regions.  The  results  of  Chapter  V  are  related 
to  this  concept,  and  finally  the  relationship  with  the  clas¬ 
sical  game  is  discussed. 

Role  Determination 

The  role  determination  problem  is  considered  to  consist 
of  two  parts: 

1.  Which  player  has  an  advantage,  and  to  what  extent? 

2.  Based  on  the  relative  advantage,  what  is  now  the 
objective  (the  role  decision)  of  each  player? 

Both  parts  are  inter-related  because  the  degree  of  advantage 
obviously  depends  on  the  objectives.  From  the  pilots  point 
of  view,  the  questions  of  role  are: 

1.  How  good  or  bad  is  my  situation? 

2.  What  should  be  my  goal? 

For  example,  if  the  situation  is  good,  then  should  the  objective 
be  to  continue,  or  to  escape  while  that  is  possible?  This 
decision  is  easier  if  the  amount  of  advantage  is  high  or  low. 

For  example,  if  one  pilot  is  about  to  shoot  down  an  opponent, 
he  is  unlikely  to  flee,  and  his  opponent  has  little  choice  but 
to  vigorously  avoid  defeat.  However,  if  neither  has  a  distinct 
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advantage,  then  each  nay  remain,  or  may  be  able  to  escape 
the  combat  completely. 

The  choice  of  objectives  in  the  role  decision  problem 
also  depends  on  the  strategy  needed  to  achieve  it.  For 
example,  it  may  be  necessary  to  risk  a  close  shave  with  an 
opponent  in  order  to  reach  a  position  of  advantage.  The 
problem  of  role  is  thus  very  complex, and  casts  doubts  on  the 
validity  of  a  zero-sum  game.  However,  a  zero  sum  game  is 
easier  to  handle,  and  gives  some  solutions  to  the  role  problem. 

In  principle,  for  the  game  as  formulated,  if  the  initial 
states  are  known,  the  outcome  is  also  known  (assuming  optimal 
play  by  both  sides).  Thus  the  game  space  (G)  can  be  divided 
into  four  distinct  regions  resulting  in  the  four  possible 
outcomes  (page  11)..  These  regions  are:  RSA  (success  for  A), 

RSB  (success  for  !>) ,  RgAB  (mutual  kill),  and  R  (draw). 

Consider  a  problem,  such  as  this  one  with  the  Lynch  pay¬ 
off,  in  which  a  mutual  kill,  R  ,  is  effectively  nonexistent: 

b  Ac 

if  the  states  are  in  RSA,  and  A  plays  optimally,  then  the 
states  remain  in  RjA  for  the  duration  of  the  game,  and  term¬ 
ination  occurs  successfully  for  A  ($  <  *A  in  Fig.  16,  page  54  ). 
A  similar  argument  applies  to  B.  In  the  other  region  R,  both 
players  must  play  optimally  for  the  states  to  remain  in  this 
region  and  a  draw  outcome.  If  one  player  does  not  play  optim¬ 
ally,  then  the  states  can  move  closer  to  a  region  more  favor¬ 
able  to  his  opponent.  The  boundaries  between  the  regions 
constitute  "barriers"  in  the  sense  that  they  are  not  crossed 
in  optimal  play.  (Appendix  A,  page  74)- 
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Fig.  16  A  2-D  Representation  of  Regions  of  Success, 
and  Regions  of  Advantage  Based  on  Optimal  Play. 

A  method  of  role  determination  now  exists  for  regions 
RgA  and  Rgg.  If  x  e  Rg^,  this  implies  A  has  an  almost  total 
advantage  and  should  press  home  his  attach  (i.e.  a  definite 
pursuer);  conversely,  B  has  little  hope  of  avoiding  A  and 
should  concentrate  entirely  on  evasion  (i.e.  a'  definite  evader) 
A  similar  argument  applies  for  x  e  Rgg. 

.  The  situation  in  the  region  R,  however,  is  not  so  clear. 
Neither  has  a  decisive  advantage,  nor  is  under  an  immediate 
threat.  Each  player  presumably  wants  to  maneuver  his  opponent 
into  his  own  success  region,  though  conceivably  each  nay  wish 
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to  escape  while  the  going  is  good.-  The  division  of  G 
may  now  be  represented  by 


X 

c 

SSA  - 

role 

determineu/A's 

success 

(capture) 

region 

(6-1) 

X 

c 

rsb  > 

rol  e 

deternined/B ' s 

success 

(capture) 

region 

(6-2) 

X 

c 

R 

role 

uncertain/draw 

occurs 

for  optimal 

play 

(6-3) 

where  G  *  R  ♦  R$A  *  RSB 

RgA  and  Rg  are  capture  regions  resulting  from  this  zero 
sum  pursuit-evasion  game  in  which  A  is  the  pursuer,  B  the 
evader,  and  vice-versa.  Thus  this  game  formulation  is  useful 
in  defining  regions  of  extreme  advantage  and  disadvantage 
(indeed  complete  success  under  optimal  clay  by  the  pursuer). 
These  regions  may  be  very  small  if  the  combatants  have  a 
similar  performance  (Ref  6:32),  and  consequently,  it  is  im¬ 
portant  to  extend  the  role  problem  into  the  region  R.  The 
apparent  choice  of  objectives  in  R  suggests  a  non-zero  sun 
game;  however,  the  problem  here  is  whether  the  zero  sum  game 
and  payoff,  as  formulated,  can  be  used  in  this  region  to  de¬ 
termine  role. 


Regions  of  Advantage 

The  first  mart  of  the  role  problem  in  R  is  to  determine 
regions  of  advantage. 

One  method  of  producing  these  regions  is  to  expand 
the  solutions  outwards  from  the  successful  limits,  by  changing 
the  terminal  costs  to  0!  and  $’  (0!  >  6.  and  iL  <  6„).  As 

A  3  A  A  H  D 

shown  ip  rig.  16  (page  Si), this  would  result  in  expanding  around 
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the  success  regions  R„,  and  R„„.  These  expanded  regions 
SA  5  d 

indicate  states  which  would  result  in  a  draw  under  optimal 

play  for  the  game  as  formulated.  The  term  draw  is  misleading, 

since  it  may  include  a  wide  range  of  terminal  states:  from 

a  close  shave  ($  -v.  $.),  to  a  head  on  (<>  =  neutral  outcome, 

^  A 

<>N) ..  However,  if  is  considered  a  good  '’raw  outcome 

for  A,  and  <  $  <  dg  is  considered  a  good  draw  outcome  for 
E,  then  the  regions  that  result  in  these  outcomes  may  be  con¬ 
sidered  as  regions  of  advantage. 

The  drawback  with  this  method  is  that  on  indecisive 
outcome  results,  and  the  game  continues  indefinitely.  Also, 
the  assumption  that  a  draw  outcome  close  to  A's  success 
area  is  good  for  A  and  places  him  at  an  advantage,  is  not 
necessarily  true.  Indeed,  it  may  be  that  if  A  just  misses 
B,  then  A  is  at  a  grave  disadvantage  (for  example,  on  an 
overshoot).  This  method  is  useful  however,  because  the  regions 
of  advantage  generated  represent  the  extent  to  which  an 
opponent  could  avoid  a  kill,  should  he  wish  to,  by  playing 
optimally. 

Another  method  avoids  the  draw  outcome,  but  uses  tra¬ 
jectories  which  are  non-optimal  in  the  strict  sense  of  the 
game  as  formulated. 

Consider  that  in  a  region  of  advantage  to  A,  his  objective 
is  to  minimize  $,  and  B's  objective  is  to  allow  A  to  cone 
very  close  to  success,  in  the  hope  that  by  doing  so,  B  will 
move  Into  his  own  region  of  advantage.  In  other  words  B  is 
deliberately  allowing  himself  to  be  dtawn  close  to  A's 
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capture  regions,  R_A  (this  may  also  occur  if  B  wants  a  draw, 
but  keeps  making  mistakes).  This  strategy  is  non-optimal 
in  the  game  as  formulated;  indeed  B  is  virtually  playing  a 
non-tero  game  by  co-operating  (at  least  for  a  time)  with  A's 
objective.  Consequently,  trajectories  resulting  in  RgA  (or 
extending  from  P.  in  a  backwards  solution)  are  non-optimal 
for  B;  nevertheless,  they  do  represent  almost  the  best  that 
A  can  achieve,  with  B  co-operating  or  making  mistakes.  Thus 
these  trajectories  may  be  used  to  indicate  general  regions  of 
advantage  to  A.  Because  player  B  is,  at  least  temporarily, 
helping  his  opponent,  the  resulting  regions  of  advantage  are 
expected  to  be  large. 

A  region  of  advantage  for  A  may  now  be  defined  by  these 
starting  points  in  R  for  which  A  is  successful,  if  B,  either 
cannot  avoid  A's  successful  outcome,  or  is  using  a  strategy 
which  would  result  in  A's  success  (conversely  for  B) .  This 
gives 

x  e  Raa  iff  B  plays  to  $  <  (6-4 ) 

x  e  Rab  iff  A  plays  to  6  >  «B  (6-5) 

where  R  is  A's  region  of  advantage,  and  R  is  B's 
A  A  Ad 

region  of  advantage.  The  possttility  also  exists  that  the 
regions  of  advantage  may  intersect  (RAA  0  RAg  f  0). 

The  first  part  of  the  role  determination  in  R  problem 

is  now 

x  E  R^a  ♦  A's  advantage  (6-6) 

-  €  RAB  *  »’*  advantage  (6-7) 

—  c  <RAA  ^  RAB>  t0  both  (6-8) 
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Fig.-  17.  A  2-D  Representation  of  Regions  of  Advantage 
Based  on  Non-optinal  Play. 


and  is  shown  in  Fig.  17,  page  58  .  It  follows  that 


rsa  c 

raa 

rsb  c 

rab 

be  conplete,  if  R^A  and 

R^g  do  not  fill  R  (RAA  U  R^g  f  R) 

then  x  c  (R^  u  R^g)  *  no  advantage  to  either. 

The  second  part  of  the  role  deternination  is  to  decide 
what  the  objective  should  be.  For  exaaple,  if  B  is  in  A's 
region  of  advantage,  R^,  should  he  play  optimally  for  the 
temporary  respite  of  a  draw,  or  should  he  allow  hiaself  to  get 
close  to  A’s  success  region,  Rci,  in  order  to  put  hiaself  ir.  a 
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good  position.  Conversely,  should  A  continue  to  make  the 

i 

most  of  his  advantage  and  olav  towards  a  kill.  Both  may 

I 

decide,  if  able,  to  escape,  Clearly,  this  nroblem  requires 
other  game  formulations. 

Regions  of  Advantage  and  the  Pi snersal  Surface.  The 
game  formulation  gives  trajectories  and  surfaces  backwards 
from  an  isocost  d^  on  the  TS,  hence  regions  of  success 
and  advantage  (as  defined  above)  should  emerge.  Looking  at 
the  game  in  this  way,  disnersal  surfaces  do  not  terminate 
trajectories,  although  they  still  are  used  to  en.iose  regions 
of  success  and  advantage.  Consider  a  2-D  example  shown  in 
Fig.  6  (page  29  ),  with  a  OS  point  Xpg  on  a  trajectory 
leaving  the  TS  from  Continue  along  the  trajectory  from 

XDS  to  Y.  For  the  game  as  formulated  the  path  YX^  is 
strictly  non-optimal  because  YZ  results  in  a  better  value 

i 

(d2)  for  B.  Thus  point  Y  lies  in  the  region  R,  and  XDS  1 

defines  a  point  on  the  boundary  of  A's  succcss/capture 

region,  R,.^.  However,  if  B  accepts  the  outcoi  e  d  <  dA>  then  ' 

YX  may  be  used  to  indicate  A's  regions  of  advantage  R^. 

Results  and  Role  Determination 

As  suggested  in  Chapter  V,  it  is  likely  that  PS  points 
Aj  and  Bj  of  Figs.  9  and  13  are  on  the  surface  that  encloses 
the  success  regions  RSA  and  Rgg.  Thus,  as  expected,  the 
capture  regions  themselves  are  relatively  small  (Ref  6:46). 

Trajectories  extending  beyond  these  points  show  a  general 
pattern;  virtually  all  of  B’s  successful  trajectories  are 
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forward  of  him,  are  within  his  weapon  range  on  either  side, 
and  are  overshoot  scissors  trajectories.  This  general  pattern 
is  a  reasonably  valid  result  for  a  slower  more  maneuverable 
machine.  On  the  other  hand,  A's  successful  trajectories 
cover  a  larger  part  of  the  x-v  plane,  are  not  so  often  scissors, 
and  include  singular  "catch-up"  arcs.  Here  again,  the  general 
pattern  is  consistent  with  the  faster  aircraft.  Furthermore, 
the  trajectories  show  that  A  has  a  larger  region  of  advantage 
than  B.  Intersection  of  RAA  and  RAg  also  seems  likely  to 
occur  forward  of  A's  terminal  area. 

Consider  again  the  OS  point  (Fig.  9,  page  38  );  if 
B  decides  to  avoid,,  one  of  the  ways  he  could  do  so  would 
be  to  remain  on  a  collision  course  and  accept  a  neutral  ♦ 

(i.e.  play  optimally  in  the  hone,,h»t  A  will  do  likewise).  How¬ 
ever,  if  he  decides  this  is  not  good  enough,  and  tries  to 
maneuver  into  an  attacking  position,  he  may  first  be  obliged 
to  accept  a  close  shave  from  A  by  entering,  and  holding  a 
turn  along  a  trajectory  terminating  in  $A.  Then  A,  having 
missed  (hopefully)  and  overshot,  gives  B  his  opportunity. 

Whether  this  maneuver  is  the  best  for  B  is  uncertain,  what 
can  be  said  however,  is  that  as  B^  lies  in  the  region  R,  and 
on  a  $A  trajectory,  it  is  a  point  in  the  surface  of  A's 
region  of  advantage  RAA>  The  indication  at  Bj  is  that  that 
point  is  in  RAA  for  all  angles  off,  a. 

A  different  situation  arises  at  point  Bg  (Fig.  9,  page  38 ) 
here  B  is  overshooting  A  and  is  turning  outwards.  This  seems 
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contrary  to  experience  because  he  is  so  close  to  the  TS  that 
by  turning  in,  it  apnears  he  could  easily  achieve  a  draw  out- 
cone  of  <}>  >  <) A,  and  get  into  an  advantageous  position.  The 
implication  is  that  tnis  part  of  the  trajectory  lies  in  R, 
and  indeed  on  the  surface  of  RAA--  However,  here  the  region 
of  B's  advantage  RAg,  extending  forward  from  B's  success 
region,  is  very  close.  Thus  B,  by  turning  in  at  is  likely 
to  move  into  RAg  a  lot  quicker,  and  certainly  with  less  dan¬ 
ger,  than  following  the  trajectory  that  results  in  $>A>  and 
accepting  a  close  shave. 

One  more  example  further  illustrates  that,  although 
trajectories  in  R  are  useful  to  locate  regions  of  advant¬ 
age,  they  are  not  necessarily  the  best  strategies  for  either 
of  the  players.  Their  strategies  will  result  from  the  role 
decision  problem.  Consider  a  scissors  maneuver  in  which  A 
is  overshooting  (Fig.  13  page  42  ),  ’t  is  likely  that  most 
of  the  trajectory  is  in  R,  and  defines  RAB;  however,  as  such, 

A  is  not  obliged  to  accept  the  scissors  as  being  the  best 
way  of  regaining  the  advantage  (although  he  may  be  so  obliged 
because  of  limited  information  etc  ).  Indeed  some  temporary 
draw  outcome  (created  by  say,  a  singular  dash)  could  possibly 
get  A  into  his  region  of  advantage  R^A  faster,  and  in  less 
danger,  than  continuing  to  B's  success  region  R_„,  and  finally 
$g,.  This  trajectory  is  a  "degenerate  scissors"  in  the  sense 
that  B's  movement  about  A's  longitudinal  axis  is  small.  Further 
investigation  may  reveal  a  singular  arc  here.  Thus  the 
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results  show  that: 

1.  Using  trajectories  beyond  capture  regions  is 
useful  in  establishing  regions  of  advantage  on 
which  role  decisions  can  be  based. 

2.  These  trajectories  are  not  necessarily  the  best 

way  to  move  from  one  region  of  advantage  to  another. 

A  full  solution  to  the  game  is  needed  to  confirn  the  prospect 
in  subpara  1.  and  throw  further  light  on  the  problem  in  sub¬ 
para  2. 

Relation  to  the  Classical  Gane 

It  is  interesting  to  relate  the  results  of  this  game 
to  the  classic  game  formulation  and  resulting  barriers  (Ref  8). 
Using  Lynch's  work  on  barrier  closure  (Ref  9:106)  it  was 
found  that,  with  the  parameters  used  in  Chapter  V  for  detailed 
analysis,  barrier  closure  did  not  occur.  Thus  B  has  no 
chance  of  escape  (for  0  <  tf  <  •»,  and  capture  defined  as  B 
simply  crossing  the  TS),  and  conversely  A  can  always  do  so 
(at  least  when  outside  the  TS) . 

Thus  the  geneial  pattern  of  trajectories  and  areas  of 
edvantage  found  using  the  Lynch  payoff  without  time,  agrees 
with  the  classical  formulation.  However,  the  introduction 
of  a  payoff  that  reflects  a  desire  to  remain  in  •'ombat  rather 
than  escape  appears  to  have  created  capture  regions  frr  both 
A  and  B  and  altogether  made  the  aodel  richer  and  mori:  realistic. 

Summary 

The  game  of  kind  formulation  from  a  terminal  payoff 
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car  be  used  to  determine  capture  regions  and  to  indicate 
regions  of  advantage.  For  the  capture  regions,  role  is 
determined.  The  regions  of  advantage  are  useful  for  making 
role  decisions,  but  do  not  indicate  the  actual  strategy  to 
use.  Results  so  far  confirm  this,  but  the  complete 
solution  to  the  game  is  reauired.-  The  classic  game  gives 
the  sane  basic  results  in  a  degenerate  form. 
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VII.  Conclusions  and  Recommendations 


Conclusions 

The  primary  objective  of  this  thesis  was  to  study 
the  application  and  validity  of  the  generalized  Lynch  pay¬ 
off  to  the  problem  of  role  determination  in  a  free  time, 
zero  sum,  perfect  information  differential  game  model  of  a 
two  aircraft  combat.  The  secondary  objective  was  to  establish 
techniques  for  using  the  payoff  in  simple  dynamic  models. 

A  constant  speed,  horizontally  planar  dynamic  model 
was  used,  and  time  was  dropped  from  Lynch's  payoff.  This 
resulted  in  valid  controls  at  the  end  of  the  game,  and 
closed  form  solutions  were  derived  for  the  terminal  co-states, 
for  the  states  and  co-states  along  constant  control  arcs, 
and  for  switching  and  singular  surfaces  at  junctions.  These 
solutions  were  then  used  directly  and  indirectly  to  partition 
the  game  space  outward  from  the  terminal  surface.  This  was 
not  fully  achieved,  though  isolated  singular  arcs  were  ident¬ 
ified  and  confirmed  for  tnis  type  of  game,  and  their  location 
and  termination  in  relation  to  various  payoffs  was  found. 

Some  dispersal  points  were  located,  which  gave  an  indication 
of  partitioning  and  capture  regions  for  each  combatant.  A 
strong  boost  to  the  validity  of  this  approach  are  trajectories 
that  correspond  to  practical  scissors  maneuvers.  Unfortunately, 
much  of  the  length  of  these  trajectories  are  likely  to  be 
non-optimal  for  the  game  as  formulated;  however,  a  technique 
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was  proposed,  using  these  trajectories  and  their  intersecting 
surfaces,  to  estimate  regions  of  advantage  for  each  player.  A 
comparison  with  Isaacs'  classical  barrier  formulation  agrees 
broadly  with  the  general  pattern  of  trajectories  produced 
by  this  formulation. 

Hence  the  thesis  has  achieved  both  objectives  to 
some  extent,  and  enables  a  limited  main  conclusion  that  the 
Lynch  payoff,  in  its  terminal  form,  shows  some  promise 
for  producing  solutions  for  simple  dynamic  models  that  bear 
some  resemblance  to  actual  combat.  Furthermore,  the  payoff 
can  handle  a  wide  range  of  weapon  systems. 

Recommendations 

This  thesis  should  definitely  not  be  regarded  as 
approaching  a  reasonably  valid  differential  game  model  of 
aerial  combat.  Not  only  must  the  solution  in  the  large 
be  fully  completed,  far  too  many  important  factors  have  had 
to  be  assumed  away,  in  order  to  make  some  initial  progress.; 

Consequently,  there  are  four  main  recommendations 
for  further  investigation: 

1.  The  complete  partitioning  of  the  game  space 
and  full  investigation  of  the  use  of  advantage 
regions  in  role  determination.  This  will  involve 
refining  techniques  for  the  location  of  the  game 
surfaces. 

2.  The  effect  of  including  time  in  the  payoff  and  the 
interaction  with  other  gane  formulations,  in 
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particular  a  non-zero  sun  formulation. 

3.  A  full  parameter  investigation  of  the  solution 
behaviour. 

4.  The  extension  of  the  game  model  to  include  factors 
such  as  restricted  information,  and  the  extension 
of  the  dynamic  model  to  include  variable  velocity 
and  altitude. 
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If  u*  and  v*  can  be  found,  the  pair  (u° ,  v*)  is  called 
a  saddle  point  of  the  game,  and  (u*  v°)  is  called  the 


Necessary  Conditions  for  a  Solution 

The  existence  of  a  solution  depends  on  the  commutative 
condition  that 


min  max  J  (u,  v)  *  max  nin  J  (u,  v) 


A  necessary  condition  for  a  saddle  point  is  that 
the  Hamiltonian  (It)  defined  as 


:  o 


H  (x,  A^  u ,  v,  t)  »  X1  £  ♦  L 


must  be  minimized  over  the  admissable  set  of  u,  and 
maximized  over  the  admissable  set  of  v,  such  that 
H*  »  min  max  II  »  max  rain  H 


giving,  if  there  are  no  control  constraints 

II  »  0  and  II  «  0  ( 

u  v 

The  commutative  condition  of  Eqs  (A-5)  and  (A-7) 
is  satisfied  if  H  is  separaDle  for  u  and  v.  If  u  anJ  v 
appear  linearly  in  II  and  are  constrained  by 


| u I  <  u  and  1 v I  <  v 
1  1  -  max  1  •  -  »a 


with  switching  functions  $u  and  Sy  defined  as 


S  (x,  A)  •  II  and  S  (x,  A)  »  II 
u  -  -  u  v  -  -  v 
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The  saddle  point  controls  are  Riven  by 


u  -  if  s„  <  0 
max  u 


-u  if  S  >  0 
max  u 


v  if  S  >  0 
nax  v 


v*  =  -v  if  S  <  0 
nax  v 


CA-  11) 


Where  there  are  no  state  dependent  control  constraints,  the 
n  -  dimensional  co-state  vector  X  is  given  by 


-H 


(A- 12) 


subject  to  the  transversality  conditions 


A(tf)  -  6  (tf)  ♦  v  *  (tr)  (A- 1 3) 

H(tf)  -  *t  (tf)  ♦  V  £t  (tf)  (A  — 14) 

where  v  is  a  constant  nultiplier. 

If  time  does  not  appear  explicitly  in  either  11 
or  the  control  constraints,  then  II  is  constant.  Also,  if 
the  problem  is  one  of  free  final  time>  then  Eq  (A- 14) 
implies  that 

H(tf)  -  II  (t)  »  0  (A-1S) 

If  u  and  v  are  expressed  as  functions  of  time  and 
the  initial  states,  then  this  is  an  open  loop  solution. 

If  they  are  expressed  as  functions  of  time  and  the  instan¬ 
taneous  states,  then  the  solution  constitutes  a  closed 
loop  law. 


I, 

i 

I 
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Singular  Controls  and  Surfaces 

When  the  controls  u  and  v  apoear  linearly  in  II 
the  possibility  of  solution  arcs  with  singular  controls 
exists.  Using  the  switching  functions  defined  in 
Eq  (A-10),  II  nay  be  arranged  as 


H  =  XT  g  (x)  <■  S  u  -  S 

~  *  —  I!  V 


(A- 16) 


Taking  the  cas.e  where  S  «  0,  H  then  bccones 
u 

independent  of  u  and  thus  minimi  tat > on  with  respect  to 
u  is  not  possible.  Similarity  for  v,  if  S^  «  0.  However, 
a  necessary  condition  for  a  solution  on  a  singular  arc  is 


sa  (£.  X)  *  Su  (x,  X)  -  -  -  Su 


2q-l 


(X,  X)  »  0  (A- 17) 


where  successive  differentiation  yields  a  function 
S  (x,  X,  u)  *  0 


(A- 1 8) 


i 


o 


which  is  explicit  in  u  and  nay  be  solved  to  yield  a 
candidate  for  u*  on  the  singular  arc.;  Thi*  aust  satisfy 
the  generalized  legendre-Clebsch  condition 


£  (!"1  1 

ctions  wit) 

£(0  I 

aust  also  be  satisfied. 


and  at  Junctions  with  nonsingular  arcs,  the  condition 


<  0 

junction  tine 


(A- ID) 


(A-20) 
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Equation  (A - 17}  gives  a  singular  surface  of  dincnsion 
2n  -  2q  in  (x-X)  space;  with  the  additional  constraint 
of  Eq  (A-15),  in  a  free  time  problem  the  dimension  becomes 
2n  -  2q  -  1 If  n  *  3  and  2  =  1  this  yields  a  sirgular 
surface  in  the  state  space  on  which  all  singular  arcs  must 
lie.  Hence  intersection  with  this  surface  is  a  necessaryi 
but  not  sufficient  condition  for  switching  to  a  singular 
arc.  Another  important  property  of  singular  arcs  is  that 
they  can  be  joined  by  an  infinity  of  paths. 

Switching  Surface 

A  switching  surface  consists  of  points  at  which  the 
controls  are  discontinuous.  From  Eqs  (A-ll)  and  C A - 17) 
the  controls  switch  only  when 

S  =0  (for  u) ,  and  S  =0  (for  v)  (A-21) 

u  v 

This  condition  defines  a  switching  surface  in  (x-X)  space 
and  implies  that  a  singular  surface  is  a  particular  type 
of  switching  surface,  also  satisfying  Eq  f A- 1 7 ) . 

The  Barriers  and  Games  of  Kind  and  Degree 

Isaacs  (Ref  8:35)  classisfies  games  into  those  of 
"kind”, and  those  of  "degree".  In  the  game  of  kind,  the 
primary  interest  is  whether  or  not  termination  (as  defined 
by  the  game)  occurs.  For  a  game  of  degree  however,  term¬ 
ination  is  assumed  to  occur,  and  the  players'  objectives 
are  to  hasten  or  delay  termination,  or  to  minimax  a 


0 


o 


continuous  payoff. 

There  is  not  a  clear  distinction  between  these  classes 
of  games  (Ref  8:230);  a  game  of  kind  can  be  regarded  as  one 
of  degree  in  which  bounds  are  placed  on  the  terminal  pay¬ 
off.  Termination  of  the  game  is  then  defined  as  occurring 
only  when  the  constraints  are  satisfied  and  the  payoff  is 
within  the  specified  bounds.  In  the  usual  classical  form¬ 
ulation  (Ref  8),  the  terminal  payoff,  in  a  game  of  degree, 

is  the  final  tine  (tf),  and  the  terminal  constraint  is  the 
2  2  2 

circular  region  x  -  y  «  L  .  The  game  becomes  a  game  of 
kind  when  the  bounds  0  <  tf  <  *  are  placed  on  the  payoff, 
and  the  only  acceptable  outcome  must  be  within  chess  bounds. 
Hence  the  usual  classical  formulation  of  a  game  considers  the 
possibility  of  the  terninal  constraint  ever  being  satis¬ 
fied.,  Carnes  of  degree  with  terminal  payoffs  can,  in  a 
similar  way,  be  treated  as  games  of  kind  (Ref  6). 

For  a  game  of  kind  there  may  be  a  surface  in  the  game 
called  a  "barrier".  In  the  classical  formulation  this 
surface  consists  of  those  starting  points  which  end  at  the 
limit  of  all  possible  terminations  (BUP).,  In  the  more  gen¬ 
eral  sense,  a  barrier  consists  of  starting  points  which 
terminate  on  the  bounds  set  by  the  terminal  payoff.  If 
the  barrier  encloses  entirely  some  region  of  the  game  space 
then  the  space  is  divided  into  distinct  regions  of  success 
and  failure  (Ref  3:66). 
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where  (£pS)  represents  the  value  of  the  payoff  along  the 
K**1  possible  path  from  a  single  point  Xpg  on  the  dispersal 
surface.  Figure  6  illustrates  this  for  a  two  dimensional 
problem  with  K  «  2.  Intersection  with  the  dispersal  sur¬ 
face  thus  represents  a  decision  point  at  which  one,  or 
sometimes  both  players  say  be  able  to  choose  strategies, 
knowing  that  in  either  case  the  payoff  will  be  the  same 
regardless  of  the  choice..  In  representing  the  limit  of 
optimality  of  a  trajectory,  it  is  useful  for  terminating 
arcs  and  bounding  regions. 
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Appendix  B 


Derivation  of  Closed  Forn  Solutions 


Purpose 

The  purpose  of  this  appendix  is  to  show  the  derivation 
of  closed  forn  solutions  to  the  game  fornulated  in  Chapter  III. 


Terninal  Co-states  A  and  A 
- Xf  Vf 

The  terninal  co-states  A  and  A  are  given  by 

*f  Yf 

X  =  $  X  =  $ 

Xf  xf  Yf  Yf 

where  4  =  -  (a  cos  (2f-ef)  *  b  cos  ®f^  * 

,2  2  ,2 

v(xf  ♦  Vf  -  L  ) 

(B-l 

or  4  cos  (zf-8.)  »  b  cos  8f)  ♦ 

.2.2, 

vvrf-L  ) 

To  avoid  expressing  4  in  cartesian  co-ordinates  and  taking 
partial  derivatives  directlv,  the  chain  rule  can  be  used 


giving 


*e  *  Va  *  Va 


L(A  cos  0,  -  A  sin  A,} 


f 

(A  cot  9f  -  — j—  coscc  9f ; 
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The  min.  maxD  ll(tf)  »  0  condition  Rives 
A  D 


Xxf  lVB  C0S  Zf  '  VA  *  °Ayf]  * 

Xy£  [VB  Sin  Zf  '  Vf1  *  Xz£  [0B  '  °A>  “  0 


Substituting 

xef  '  $o£  =  \  (b  ,in  ef  '  a  sin  Czf  '  9f” 

ard  \z  =  ■  j  a  sin  (z£  -  0£) 


(B-4 ) 


(B-S) 


and  Lq  (B-3)  into  Eq  (B-4)  gives  an  equation  in  terns  of 

A  anc  the  final  states  which  can  he  solved  to  give 
yf 

A  U*W  (B-6) 

yf  *  2V 

where  U  *  sin  9£  (b  sin  9£<ja  -  a  si"  (*f  -  0£)  3g) 

(VB  cos  z,  -  VA) 

K  *  (b  sin  0£  -  a  sin  (z£-9£)) - - - 


V  *  (Vg  cos  (z£-9£)  -  VA  cos  0£) 


Substitution  for  X.  in  Eq  (B-3)  now  gives  A„  in  terns  of 
0f  xf 

X  and  the  final  states  as 
yf 


Ax  *  [2L  Xy  cos  0£  -  (b  sin  0£  -  a  sin  (zf-9f))] 


2L  sin  9. 


(8-7> 


State  and  Co-state  Solutions 

Over  an  arc  of  constant  controls  oA  and  Og,  the  state 
and  co-state  equations  in  cartesian  co-ordinates  are 
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I 


o 


x  «  V„  cos  z  -  V,  ♦  o.y  with  x(tf)  =  x, 
BAA  r 


(B-8) 


y  «  Vg  sin  i  -  oftx 


with  y(tf)  =  y. 


o.  -  o. 
B  A 


with  z(tf)  =  z. 


and 


x;  -  oA  y 


with  A  (tf)  ■  A 


A  *'  =  -o.x 

y  A 


with  A  (tf)  =  A 


(B-9: 


A^  *  vg(Ax  sin  z  -  A^  cos  z)  with  A^ftf)  »  A^ 


Nonsingular  Solutions .  For  nonsingular  arcs  with 


o  ]f  o  and  oD  <*  0,  three  solutions  are  obtained  immediately 
A  B 


z(t)  »  (0g-0A)  (t-tf)  ♦  zf 
AT(t)  »  A' sin  Tft  *  B’cos  O^t 

’A 


Ay(t)  ■=  -B*  sin  oAt  ♦  A’cos  oAt 


(B-H 

(B-l 

(B-l 


where 


A'«  A  sin  o.tf  ♦  A„  cos  o.tf 
*f  A  yf  A 


and 


B’«  A  cos  o.tf  -  A  sin  o.tf 

Xf  A  Ye  a 


‘f  "  'f 

Letting  tf  *  0,  so  that  along  an  arc  t  <  0,  Eqs  (B-10)  to 
(B-12)  become 

z(t)  »  (ob-oa)  t  ♦  if 


A„(t)  *  x„_  sin  ait  *  xX{  cos  °At 


X'"  yf  ’  A  x, 


(B-l 

(B-l 


A..(t)  «  -A.  sin  o.t  ♦  A„  cos  o.t 


(B-l 
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Substituting  in  Eq  (B-9)  for  z,  A^  and  gives 

A-  »  V  [A  sin  (0Rt  +  zf)  -  A  cos  (o  t  ♦  z{)  1  (B-16) 

Z  B  X£  B  t  >  f  °  1 

and  solving  with  A  (tf  =  0)  =  A  gives 
z  if 

“  \{  +  RBlXyf  sin  2f  *  Xxf  cos  2f]  ' 

R,j  [A  si»  «V  *  if)  *  Xxf  cos  {0Bt  *  2f)] 

1  (B- 1 7; 

or  a  t  aBt 

Xz(t)  “  Xzf  *  2  RB  sin  ~T  [Xy£  cos  *  2f>  * 

°Bt 

A  sin  (—5—  ♦  if) 3 
f 

where  RB  -  Vg/Og 

Finally,  for  the  states  x(t)  and  y(t),  a  second  order  BE 
can  be  forned  from  Eq  (B-8)  giving 
2  *  X  ■  V2  °A  -  °B>  Sin  2 
Substituting  in  the  solution  for  z(t)  gives 

*  ♦  °A  x  -  V2  °A  -  °B>Sin  l(0B  -  °A>  2  *  2fJ 

with  x(tf  *  0)  «  x£ 

The  solution  is  now  straightforward,  yielding 

x (t }  «  x £  cos  0At  ♦  y£  sin  oftt  ♦  RR  [sin  (oAt  -  zf)  ♦ 

(B-18) 

sin  ((oB  -  oA>  t  ♦  z£)]  -  Ra  sin  at 
Differentiation,  and  substitution  for  x  in  Eq  (B-18)  yields 
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y(t)  *  -xf  sin  oAt  ♦  yf  cos  oAt  ♦  Rg  [cos  (oAt  -  2f)  - 

(11-19) 

COS  ((0B  -  0A)  t  ♦  2f}]  ■»  RA  (1  -  COS  0At) 

Thus  Eqs  (8-18),  (B-19),  (B-1J),  (D-14),  (B-1S)  and  (B-16) 
are  the  nonsingular  arc  closed  form  solutions  with  tf  =  0. 

Singular  Solutions.  Because  RA  and  Rg  are  infinite  for 
singular  controls,  the  solutions  for  nonsingular  arcs 
cannot  be  applied  directly.  For  an  A  singular  arc,  sub¬ 
stitution  for  oA  *  0  in  Eqs  ( B - 1 3 ) ,  (B-14),  (B-15)  and 
(B-17),  and  application  of  the  singular  conditions  A^  -  0 
yields 


t(t) 
Ax(t) 
A  (t) 

y 

Az(t) 
Solving  the 


-  zf  ♦  oAt 


»  A  .0 

yf 

«Rt  °Rt 

«  A  -  2  R.  A  sin  — sin  (-7—  ♦  z~) 
zf  B  xf  2  2  f 

equations  of  notion  directly  with  oA  *  0 


(B-20) 
(  B  —  2 1 ) 
(B-22) 

{  B  —  2  3 ) 
gives 


x(t)  *  xf  -  VAt  ♦  Rg  (sin  ( 2 £  ♦  Ogt)  -  sin  2f)  (B-24) 

y(t)  c  yf  -  Rg  (cos  (Cgt  ♦  zf)  -  cos  zf)  (B-25) 

For  a  B  singular  arc,  substitution  of  o8  ’  0  into  Eq  ( B - 1 3 ) 
with  the  singular  condition  A£  »  0  gives 

z(t)  *  zf  -  oAt  (B-26) 


A  ft)  -  A_  «  0 


(B-27) 
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and  *x(t)  *  (t)  are  still  given  by  Eqs  (B-14)  and 

(B-15).  Solving  the  equations  of  motion  directly  with 
o„  =■  0  gives 

x(t)  =  xf  cos  0At  ♦  (yf  -  Ra)  sin  oAt  ♦ 

VBt  cos  (zf  -  oAt) 

y(t)  =  -xf  sin  oAt  ♦  (yf  -  RA)  cos  oAt  ♦ 


V  t  sin  (z  -  at) 
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